智能论文笔记

Ultra-low-power Range Error Mitigation for Ultra-wideband Precise Localization

Simone Angarano , Francesco Salvetti , Vittorio Mazzia , Giovanni Fantin , Dario Gandini , Marcello Chiaberge

分类：机器学习 | 机器人

2022-09-07

在室外和室内环境中的精确定位是一个具有挑战性的问题，目前构成了几种实际应用的重要限制。超宽带（UWB）本地化技术代表了解决该问题的宝贵低成本解决方案。然而，特定无线电环境的非视线（NLOS）条件和复杂性很容易在范围测量中引入正偏见，从而导致高度不准确和不令人满意的位置估计。鉴于此，我们利用了深神网络优化技术的最新进步及其在超低功率微控制器上的实施，以引入有效的范围错误缓解解决方案，该解决方案可在NLOS或LOS条件下提供校正，并具有几兆瓦的功率。我们广泛的实验认可了我们的低成本和力量效率方法的优势和改进。

translated by 谷歌翻译

Back-to-Bones: Rediscovering the Role of Backbones in Domain Generalization

Simone Angarano , Mauro Martini , Francesco Salvetti , Vittorio Mazzia , Marcello Chiaberge

分类：计算机视觉 | 机器学习

2022-09-02

域的概括（DG）研究了深度学习模型推广到训练分布的能力。在过去的十年中，文献已经大量填充了一系列培训方法，这些方法声称获得了更抽象和强大的数据表示以应对域的转移。最近的研究为DG提供了可再现的基准，指出了天真的经验风险最小化（ERM）对现有算法的有效性。然而，研究人员坚持使用相同过时的特征提取器，并且尚未注意不同骨干的影响。在本文中，我们从骨干开始，提出了对其内在概括能力的全面分析，迄今为止，研究界忽略了。我们评估了各种特征提取器，从标准残差解决方案到基于变压器的架构，发现大规模单域分类精度和DG功能之间的线性相关性。我们广泛的实验表明，通过采用竞争性骨干与有效的数据增强结合使用，普通ERM的表现优于最近的DG解决方案，并实现了最先进的准确性。此外，我们的其他定性研究表明，新型骨架提供了与同类样本更相似的表示，从而将特征空间中的不同域分开。这种概括能力的增强功能使DG算法的边缘空间为调查问题，提出了一个新的范式，将骨干放在聚光灯下，并鼓励在其顶部开发一致的算法。

translated by 谷歌翻译

A Deep Learning Driven Algorithmic Pipeline for Autonomous Navigation in Row-Based Crops

Simone Cerrato , Vittorio Mazzia , Francesco Salvetti , Marcello Chiaberge

分类：机器人 | 人工智能

2021-12-07

昂贵的传感器和低效的算法管道显着影响自动机器的总成本。然而，实惠的机器人解决方案对于实际使用至关重要，其财务影响构成了在大多数申请领域采用服务机器人的基本要求。其中，精密农业领域的研究人员努力设计强大，经济高效的自主平台，以提供真正的大规模竞争解决方案。在本文中，我们提出了一个完整的算法管道，用于基于行的作物自主导航，专门设计用于应对低范围的传感器和季节性变化。首先，我们建立一个强大的数据驱动方法，为自主机器生成一个可行的路径，仅涵盖庄稼的占用网格信息的裁剪的完整扩展。此外，我们的解决方案利用了深入学习优化技术和综合生成数据的最新进步，以提供一种实惠的解决方案，可有效地解决由于植被生长在行的植被而有效地解决了众所周知的全球导航卫星系统不可靠性和降级。对计算机生成的环境和现实世界作物的广泛实验和模拟表明了我们的方法的稳健性和内在的完全平整性，其开辟了高度实惠和完全自主机器的可能性。

translated by 谷歌翻译

Action Transformer: A Self-Attention Model for Short-Time Human Action Recognition

Vittorio Mazzia , Simone Angarano , Francesco Salvetti , Federico Angelini , Marcello Chiaberge

分类：计算机视觉 | 机器学习

2021-07-01

基于纯粹关注的深度神经网络在几个领域中取得了成功，依赖于设计师的最小建筑前瞻性。在人类行动识别（HAR）中，主要是在标准卷积或复发层的顶部采用注意机制，从而提高了整体泛化能力。在这项工作中，我们介绍了动作变压器（ACT），这是一种简单的完全自我注意的架构，可以始终如一地优于混合卷积，复发和周度的更详细的网络。为了限制计算和能量请求，建立以前的人类行动识别研究，所提出的方法利用小型时间窗口的2D姿势表示，为准确且有效的实时性能提供低延迟解决方案。此外，我们开源MOMES2021是一个新的大规模数据集，作为建立正式培训和评估基准的实时短时哈哈。拟议的方法在MOMY2021上广泛测试，并与几个最先进的架构相比，证明了行为模型的有效性并铺设了未来工作的基础。

translated by 谷歌翻译

Beyond SOT: It's Time to Track Multiple Generic Objects at Once

Christoph Mayer , Martin Danelljan , Ming-Hsuan Yang , Vittorio Ferrari , Luc Van Gool , Alina Kuznetsova

分类：计算机视觉

2022-12-22

Generic Object Tracking (GOT) is the problem of tracking target objects, specified by bounding boxes in the first frame of a video. While the task has received much attention in the last decades, researchers have almost exclusively focused on the single object setting. Multi-object GOT benefits from a wider applicability, rendering it more attractive in real-world applications. We attribute the lack of research interest into this problem to the absence of suitable benchmarks. In this work, we introduce a new large-scale GOT benchmark, LaGOT, containing multiple annotated target objects per sequence. Our benchmark allows researchers to tackle key remaining challenges in GOT, aiming to increase robustness and reduce computation through joint tracking of multiple objects simultaneously. Furthermore, we propose a Transformer-based GOT tracker TaMOS capable of joint processing of multiple objects through shared computation. TaMOs achieves a 4x faster run-time in case of 10 concurrent objects compared to tracking each object independently and outperforms existing single object trackers on our new benchmark. Finally, TaMOs achieves highly competitive results on single-object GOT datasets, setting a new state-of-the-art on TrackingNet with a success rate AUC of 84.4%. Our benchmark, code, and trained models will be made publicly available.

translated by 谷歌翻译

Importance of Synthesizing High-quality Data for Text-to-SQL Parsing

Yiyun Zhao , Jiarong Jiang , Yiqun Hu , Wuwei Lan , Henry Zhu , Anuj Chauhan , Alexander Li , Lin Pan , Jun Wang , Chung-Wei Hang

分类：自然语言处理

2022-12-17

Recently, there has been increasing interest in synthesizing data to improve downstream text-to-SQL tasks. In this paper, we first examined the existing synthesized datasets and discovered that state-of-the-art text-to-SQL algorithms did not further improve on popular benchmarks when trained with augmented synthetic data. We observed two shortcomings: illogical synthetic SQL queries from independent column sampling and arbitrary table joins. To address these issues, we propose a novel synthesis framework that incorporates key relationships from schema, imposes strong typing, and conducts schema-distance-weighted column sampling. We also adopt an intermediate representation (IR) for the SQL-to-text task to further improve the quality of the generated natural language questions. When existing powerful semantic parsers are pre-finetuned on our high-quality synthesized data, our experiments show that these models have significant accuracy boosts on popular benchmarks, including new state-of-the-art performance on Spider.

translated by 谷歌翻译

On the Energy and Communication Efficiency Tradeoffs in Federated and Multi-Task Learning

Stefano Savazzi , Vittorio Rampa , Sanaz Kianoush , Mehdi Bennis

分类：机器学习

2022-12-02

Recent advances in Federated Learning (FL) have paved the way towards the design of novel strategies for solving multiple learning tasks simultaneously, by leveraging cooperation among networked devices. Multi-Task Learning (MTL) exploits relevant commonalities across tasks to improve efficiency compared with traditional transfer learning approaches. By learning multiple tasks jointly, significant reduction in terms of energy footprints can be obtained. This article provides a first look into the energy costs of MTL processes driven by the Model-Agnostic Meta-Learning (MAML) paradigm and implemented in distributed wireless networks. The paper targets a clustered multi-task network setup where autonomous agents learn different but related tasks. The MTL process is carried out in two stages: the optimization of a meta-model that can be quickly adapted to learn new tasks, and a task-specific model adaptation stage where the learned meta-model is transferred to agents and tailored for a specific task. This work analyzes the main factors that influence the MTL energy balance by considering a multi-task Reinforcement Learning (RL) setup in a robotized environment. Results show that the MAML method can reduce the energy bill by at least 2 times compared with traditional approaches without inductive transfer. Moreover, it is shown that the optimal energy balance in wireless networks depends on uplink/downlink and sidelink communication efficiencies.

translated by 谷歌翻译

Novel Chapter Abstractive Summarization using Spinal Tree Aware Sub-Sentential Content Selection

Hardy Hardy , Miguel Ballesteros , Faisal Ladhak , Muhammad Khalifa , Vittorio Castelli , Kathleen McKeown

分类：自然语言处理

2022-11-09

Summarizing novel chapters is a difficult task due to the input length and the fact that sentences that appear in the desired summaries draw content from multiple places throughout the chapter. We present a pipelined extractive-abstractive approach where the extractive step filters the content that is passed to the abstractive component. Extremely lengthy input also results in a highly skewed dataset towards negative instances for extractive summarization; we thus adopt a margin ranking loss for extraction to encourage separation between positive and negative examples. Our extraction component operates at the constituent level; our approach to this problem enriches the text with spinal tree information which provides syntactic context (in the form of constituents) to the extraction model. We show an improvement of 3.71 Rouge-1 points over best results reported in prior work on an existing novel chapter dataset.

translated by 谷歌翻译

Unsupervised Reward Shaping for a Robotic Sequential Picking Task from Visual Observations in a Logistics Scenario

Vittorio Giammarino

分类：机器人 | 机器学习

2022-09-25

我们专注于一个典型的物流部门的卸载问题，该问题被建模为顺序的选择任务。在这种类型的任务中，现代的机器学习技术已经显示出比经典系统更好的工作，因为它们更适合随机性，并且能够更好地应对大型不确定性。更具体地说，在这方面，有监督和模仿学习取得了出色的成果，因为需要某种形式的监督，这对于所有设置并不总是可获得的。另一方面，加固学习（RL）需要许多更温和的监督形式，但由于其效率低下仍然不切实际。在本文中，我们提出并理论上激励了一种新颖的无监督奖励构成算法，从专家的观察结果中塑造了算法，该算法放宽了代理商所需的监督水平，并致力于改善我们任务中的RL绩效。

translated by 谷歌翻译

On the Opportunities and Challenges of using Animals Videos in Reinforcement Learning

Vittorio Giammarino

分类：人工智能 | 计算机视觉 | 机器学习

2022-09-25

我们研究使用动物视频来提高增强学习（RL）效率和性能的可能性。从理论角度来看，我们激励使用加权策略优化对非政策RL的使用，描述从视频中学习并提出解决方案时面临的主要挑战。我们在离线和在线RL中测试我们的想法，并在一系列2D导航任务上显示令人鼓舞的结果。

translated by 谷歌翻译